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Abstract: Mitochondrial genome sequence of malaria parasites has served as a potential marker for inferring evolutionary 
history of the Plasmodium genus. In Plasmodium falciparum, the mitochondrial genome sequences from around the globe 
have provided important evolutionary understanding, but no Indian sequence has yet been utilized. We have sequenced 
the whole mitochondrial genome of a single P. falciparum field isolate from India using novel primers and compared with 
the 3D7 reference sequence and 1 previously reported Indian sequence. While the 2 Indian sequences were highly diver- 
gent from each other, the presently sequenced isolate was highly similar to the reference 3D7 strain. 
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Malaria is a vector-bome infectious disease, endemic to many 
of the tropical and subtropical countries of the globe including 
India. Approximately half (273 million) of the high risk popu- 
lation outside Africa resides in India [ 1 ] . Plasmodium falciparum 
malaria is the leading cause of deaths and accounts for about 
50% of malaria cases in India [2]. The problem is further com- 
pounded by high virulence and emergence of drug resistance 
in P. falciparum, and till now no effective vaccine is available. 
The primary huddle to design an effective vaccine that would 
work in all malaria endemic populations is highly observed ge- 
netic diversity in P. falciparum field isolates, as this parasite uses 
the genetic diversity to fight against the anti-malarial drugs and 
host immunity [3]. Therefore, the analysis of within-species 
genetic diversity is very important for understanding evolution- 
ary processes both at the population and the genetic level which 
will not only enlightens the origin, historical migration, and 
demography of different populations, but also inform if new 
parasite genotypes of high virulence and drug resistance are 
emerging and spreading to different populations. Such under- 
standing on the long run will definitely be of help in devising 
effective population-based control measures. 
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To this respect, mitochondrial {mi) genome has served as an 
ideal marker to understand evolutionary history of many mod- 
el and non-model organisms and has been a marker of choice 
for reconstructing historical patterns of population demogra- 
phy and phylogenetic studies [4]. Several characteristics features; 
such as high mutation rate [5], maternal inheritance [6], and 
lack of recombination [7] have made the mt genome an ideal 
extra-nuclear genome to reconstruct evolutionary histories of 
the species. In malaria parasites, mt genome is of particular rel- 
evance, due to (i) its small size (~6 kb), (ii) haploid, and (iii) 
contains 3 protein-coding genes, cytochrome c oxidase I (coxl), 
cytochrome c oxidase III (coxffl) and cytochrome b (cytb) [8]. 

All these 3 genes are essential for a range for cellular pro- 
cesses; like membrane potential maintenance, heme and co- 
enzyme Q biosynthesis, and oxidative phosphorylation [9]. 
Most importantly, the cytb gene of P. falciparum mitochondria 
is a potential target for an antimalarial drug, atovaquone [10]. 
Moreover, the mt genome of parasites evolves neutrally and 
shows no signs of recombination or selection [11], hence the 
whole genome behaves as a single locus and all sites share a 
common genealogy, which makes it ideal for studying within- 
species variations and phylogenetic analysis. While mt genome 
sequences have been reported from many malaria endemic 
countries of the world [11], only 1 complete mt genome se- 
quence of P. falciparum isolate originating from unknown loca- 
tion in India has so far been reported [12], Considering P. fal- 
ciparum malaria is widespread in India, lack of mt genome se- 
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quence information from multiple Indian isolates has debarred 
us in unravelling the net diversity of Indian P. falciparum. To 
fill this gap in knowledge and to initiate population genomic 
studies of Indian P. falciparum, we herewith report the whole 
mi genome sequence of a single Indian field isolate, and the 
results of preliminary comparative genomic studies between 
the existing Indian data [12] and the reference sequence (3D7 
isolate). 

Finger-pricked blood sample of a microscopically diagnosed 
P. falciparum infected malaria patient from Bilaspur (Chhattis- 
garh state, India) was spotted (2-3 spots) on Whatmann filter 
paper, dried, and brought to the laboratory in New Delhi. Ge- 
nomic DNA was extracted from these dried blood spots using 
QIAamp mini DNA isolation kit (Qiagen, Hilden, Germany). 
Since both P. falciparum and Plasmodium vivax are endemic to 
India, we used the isolated genomic DNA to perform PCR di- 
agnostic assay to identify mixed infection following the nested 
PCR with genus and species-specific primers based on 18S 
rRNA gene [13]. The study was approved by the Ethics Com- 
mittee of the National Institute of Malaria Research (NIMR), 
India, and written informed consent of the patients have been 
obtained. 

To sequence the whole mt genome of the present P. falciparum 
isolate originated from Bilaspur (refened as Blspl), we checked 
for PCR amplification using the sets of primers reported to se- 
quence the Indian P. falciparum isolate PfPHlO [12]. However, 
we could not PCR-amplify any of the fragments of the mt ge- 
nome of the Blspl isolate with repeated attempts. Therefore, we 
used the whole mt genome sequence information of the pub- 
lished 3D7 isolate, downloaded from the NCBI (www.ncbi. 
nlm.nih.gov/) database with GenBank accession no. AY282930, 
and chopped the whole mt genome into 19 different DNA 
fragments. In order to sequence the whole mt genome of P. fal- 
ciparum Bslpl isolate, we designed 19 novel primer-pairs. Two 
online computer programs, Primer3 and SIGMA primer-calcu- 
lator were used to design these novel primer-pairs so as to keep 
the length of each fragment below 600 nucleotide base pairs 
(bp) with ~ 150 bp of overlapping sequences between each ad- 
joining fragment. The length of each of the 19 PCR-amplified 
fragments was deliberately kept below 600 bp, as we have per- 
formed DNA sequencing following Sanger technology (see be- 
low). 

All PCR amplification reactions were carried out in a final 
volume of 25 pi, which included 1 pi of each primer (10 pmol/ 
yd), 0.2 mM of dNTP, 1 unit of Taq DNA polymerase (Merck) 



with 2.5 pi of lx Taq DNA polymerase buffer and 1 pi of DNA. 
Amplifications were performed with the following cycling 
conditions: 95°C for 5 min, then 35 cycles of 1 min denatur- 
ation at 95°C, 1 min annealing at different temperatures for 
different DNA fragments (Table 1), 1 min extension at 72°C 
followed by 5 min final extension at 72°C. Successfully ampli- 
fied PCR products were further purified by incubating with 

Table 1 . Details of primer sequences employed in PCR amplifica- 
tion and sequencing of the whole mt genome of Plasmodium fal- 
ciparum along with annealing temperatures of the respective frag- 
ments 



S. no. 


Primer 
name 


Primer sequence (5' to 3') 


Annealing 
temperature 


1 


M 


JF 


TGCTATTGGATTCAACGTCC 


63.7 




M. 


JR 


GTCCTGCATGMCGGTGTA 




2 


M 


_2F 


TCGTAACCATGCCAACACAT 


63.2 




M. 


_2R 


GCTGGGCATTTAATCCACTC 




3 


M 


_3F 


GGGTATCCAATCCAGTGCTC 


63.7 




M. 


_3R 


CAAACACTAGCGGTGGAACA 




4 


M 


_4F 


AGGGAACAAACTGCCTCAAG 


63 




M. 


_4R 


GGCATTTTGTTGAAATAGTCTGG 




5 


M 


_5F 


ACTTCCTTTCTCGCCATTTG 


63.7 




M. 


_5R 


GCATCATGTATGAGTGCATGTT 




6 


M 


_6F 


TTGTAGAGATGCAAAACATTCTCC 


60.4 




M. 


_6R 


GCACATCTAGTTTCATATCCTGCA 




7 


I VI. 


7r 

_/r 


CAGAATAAAAACTTTCTCGAATAGG 


61.8 




M. 


_7R 


AAGTACGCGATCTCTTGTATGG 




8 


M 


_8F 


CGCAGCCTTGCAATAAATAA 


61.8 




M. 


_8R 


L/MI bAbuL/ 1 L^IjA 1 A 1 AAA 1 IjA 




9 


M 


_9F 


GAACGCTTTTAACGCCTGAC 


52.7 




M. 


_9R 


AGTCCATCCAGTTCCACCAC 




10 


M_ 


10F 


CCAGGATTATTCGGAGGATT 


52.8 




M. 


10R 


CAGGATGTCCAAAATACCAGA 




11 


M_ 


.11 F 


CCGGTTTTAACTGGAGGAGT 


62.6 




M_ 


1 1 R 


GCTACATCAATGGCAGCAT 




12 


M. 


12F 


CCGGTACAAAAGTATTTAACTGGA 


62.6 




M_ 


12R 


GGTCATTGTTGTCCCAATAGAA 




13 


M_ 


13F 


GCATTTCAAGATAATTTCTTTGGT 


62.6 




M_ 


13R 


AAACATCTGGTGTATATCGACTTG 




14 


M_ 


14F 


CACACTTAATAAATTACCCATGTCCA 


62.6 




M„ 


14R 


GGATCACTCACAGTATATCCTCCA 




15 


M_ 


15F 


TTGTCTTACCATGGGGTCAAA 


62.6 




M_ 


15R 


CCAGCTGGTTTACTTGGAACA 




16 


M. 


16F 


TCACATCCTGATAATGCTATCG 


62.6 




M_ 


16R 


CGAAGCATCCATCTACAGCTA 




17 


M_ 


17F 


TTACAGCTCCCAAGCAAACA 


62.6 




M_ 


1 7R 


GACGGTTTTCTGCGAAATCTA 




18 


M_ 


18R 


GGGAGTTGGCAAGTTAAAGAAG 


62.6 




M_ 


18R 


GGAAGTACGAATTGAAGTGTGG 




19 


M. 


19F 


CCTGGCTAAACTTCCCAATG 


62.6 




M_ 


19R 


AGAAACAGTCGGTGCGAAGT 
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Exonuclease-I and Shrimp Alkaline Phosphatase (Fermentas, 
Life Sciences, Berlington, Ontario, Canada) and DNA sequenc- 
ing was performed in an ABI 3730XL DNA Analyzer (Applied 
Biosystems. Foster City, California, USA), an in-house sequenc- 
ing facility of NIMR. All the 19 PCR-amplified fragments were 
sequenced in both forward and reverse directions (2x coverage) 
and each DNA sequence was individually edited and assem- 
bled using the EditSeq and MegAlign modules of the Lasergene 
(DNASTAR, Madison, Wisconsin, USA) computer program. 
All the 19 finally edited sequences were manually assembled 
to form a single whole mt genome sequence and aligned with 
the 3D7 reference sequence and the published Indian P. falci- 
parum sequence (PfPHlO) [12] using the MEGAv5.0 comput- 
er program [14] to ascertain nucleotide differences, if any, by 
comparing the mt genome sequences of 3 different isolates. Fur- 
ther, to understand the phylogenetic intenelationship among 
the 2 Indian (Blspl andPfPHIO), the reference 3D7, and other 
21 Plasmodium species, Neighbor-joining (NJ) phylogenetic 
tree was constructed using the MEGA v5.0 computer program 
[14] with 100 bootstrap replicates. 

For phylogenetic analysis, the whole mt genome sequences of 
Plasmodium species infecting primates (GenBank no. AB354573, 
AB434919, AB434920, AB354574, AB354572, AY722799, 
AB354575, NC_007232, AB434918, NC_002235), rodents 
(GenBank no. AB379663, AB599931, AB558173), birds (Gen- 
Bank no. AB599930, AB250415, AB302215), Lizard (GenBank 
no. NC_009961) and humans (GenBank no. M76611, NC_ 
007243, AB354570, AB354571, AY282930), were downloaded 
from the NCBI web database (www.ncbi.nlm.nih.gov) and 
aligned using MEGA v5.0 computer program [14]. The whole 
mt genome sequence of the Blspl isolate has been deposited 
in GenBank public domain sequence database with accession 
number KJ144901. 

Using the 19 novel primer-pairs (Table 1) designed in the 
present study, we could successfully sequenced the whole mt 
genome of a single P. falciparum field isolate (Blspl) from an 
endemic locality of India with 2x coverage and compared with 



the whole mt genomes of 2 other isolates (3D7 and the previ- 
ously reported PfPHlO isolate from India) [12]. A detailed list 
of novel primers designed in the present study is provided in 
Table 1, and the approximate locations of each primer-pair in 
the circular mt genome of P. falciparum is presented in Fig. 1. 
While the total length of the whole mt genome from 2 Indian 
P. falciparum isolates (PfPHlO and Blspl) was similar in size of 
5,967 bp (Fig. 1), surprisingly, the alignment of these 2 isolates 
revealed nucleotide differences in 22 positions; 2 of these were 
in coxlll gene, 5 in coxl and, 7 in cytb genes (Table 2), suggest- 
ing very high amount of variation in the mt genome in Indian 
P. falciparum. However, when the presently sequenced mt ge- 
nome from Blspl isolate was aligned with the 3D7 isolate, only 
1 nucleotide difference could be observed between these 2 ge- 
nomes (Table 2). 

This observation was in contrast to an earlier report on high 
sequence variation in mt genome between the worldwide and 
Indian isolates involving the PfPHlO isolate [12]. In any case, 
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Fig. 1 . Schematic overview of the ~6 kb mt genome and the ap- 
proximate locations of primers to amplify the whole mt genome of 
Indian P. falciparum. 



Table 2. Alignment showing variations in the 3 mt [2 Indian (Blspl and PfPHlO) and 1 reference (3D7)] genomes of Plasmodium falci- 
parum isolates 

Nt Positions 208 222 230 510 615 1122 1339 2175 2768 3330 3433 3444 3764 3766 3868 3985 4352 4353 4420 4640 4759 4952 5485 

3D7 a GAGACGATTATTTAACATTATTT 

Blspl b C 

PfPH10 c ATAGTCGCAGACCGTGTAAGCCA 



Nt= nucleotide. 

a Joy et al. (2003). "Present study. °Sharma et al. (2001). 
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the present observation on low mitochondrial genome diver- 
sity between a single P. falciparum isolate (Blspl) and the refer- 
ence 3D7 corroborates earlier opinion on the overall low vari- 
ations among P. falciparum mt genomes [11]. Furthermore, the 
observation of 22 nucleotide changes between just 2 P. falci- 
parum mt genomes from India (Table 2) reflects many-fold 
higher diversity in India, compared to only 30 SNPs found in 
100 worldwide P. falciparum isolates of African, Asian, Papua 
New Guinean, and American origins [11]. Such observed high 
sequence variability between the Blspl and PfPHlO isolates 
might be the reason of failure for PCR amplification with 
primer information used to amplify and sequence the PfPHlO 
isolate [12]. This argument is further justified by the fact that, 
by using the 3D7 isolate (with only 1 nucleotide change), we 
could successfully amplify the Blspl isolate, justifying high se- 
quence similarities. In order to nullify the role of PCR and se- 
quencing enors, we have re-amplified and re-sequenced all the 
19 DNA fragments of Blspl isolate using a different PCR ther- 
mal cycler (total 4x coverage), but the observed results were 
not different from the previously sequenced data. This obser- 
vation essentially means that neither any inaccuracy in the PCR 
nor in sequencing techniques have conmbuted to our observed 
results. 

With the whole mt genome sequence of a second P. falci- 
parum isolate (Blspl) in hand showing high sequence differ- 
ences, we were interested to understand evolutionary interrela- 
tionships between the 2 Indian mt genomes with the reference 
3D7 isolates by constructing NJ phylogenetic tree (Fig. 2). We 
have also included the published whole mt genome sequences 
of different Plasmodium species infecting an array of organisms. 
The tree topologies of the NJ phylogenetic tree justifies the 
evolutionary patterns of Plasmodium species according to their 
respective hosts [15,16]. For the P. falciparum isolates infecting 
humans, the Blspl, 3D7, and C10 (GenBank accession no. 
M76611) form a single clade, whereas the PfPHlO isolate was 
placed away from this clade, justifying high genetic differentia- 
tion of this Indian isolates from the rest of P. falciparum isolates 
(Fig. 2). Whatever the case may be, the high sequence similari- 
ty between the whole mt genome sequences of the Blspl and 
3D7 isolates justify the notion that intra-specific mt genome 
variations are in fart minimal [11] and therefore mt genomes 
remain conserved among the phylum Apicomplexa [8] as well 
as in Plasmodium species infecting different hosts (both hu- 
mans and non-humans) [17], possibly due to very low recom- 
bination rate and uni-parental (maternal) inheritance [6]. 
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Fig. 2. Neighbour-joining (NJ) phylogenetic tree showing evolu- 
tionary inter-relationships among the Blspl, 3D7, and PfPHlO 
isolates and also among different Plasmodium species infecting 
non-human hosts. The values in the internal nodes indicate boot- 
strapped values signifying the strength of each corresponding in- 
ternal node of the NJ tree. 



However, the observed high variability between the mt ge- 
nomes of 2 Indian isolates suggests high genetic diversity in 
Indian P. falciparum [2] which can be further validated by se- 
quencing isolates from more Indian populations. Such study 
would not only fill the gap of the existing knowledge about 
the worldwide mt genome diversity but also help to bring out 
important and so far not-fully-resolved evolutionary history of 
global P. falciparum. Moreover, whole mt genome sequence 
comparisons in multiple P. falciparum isolates from all over In- 
dia would also inform the extent of genetic diversity of the cytb 
gene that is considered to be the target of an effective antima- 
larial, atovaquone [10]. As this antimalarial is currently not 
used in India, the knowledge of the extensive diversity through 
population genomic studies of mt genome of Indian P. falciparum 
would possibly help in deciding whether to incorporate atova- 
quone in malaria control programs in India. 
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